Value functions for depth-limited solving in zero-sum imperfect-information games

نویسندگان

چکیده

We provide a formal definition of depth-limited games together with an accessible and rigorous explanation the underlying concepts, both which were previously missing in imperfect-information games. The works for arbitrary (perfect recall) extensive-form game is not tied to any specific game-solving algorithm. Moreover, this framework unifies significantly extends three approaches solving that existed multiagent reinforcement learning but known be compatible. A key ingredient these value functions. Focusing on two-player zero-sum games, we show how obtain optimal functions prove public information provides necessary sufficient context computing them. domain-independent encoding domains allows approximating even by simple feed-forward neural networks, are then able generalize unseen parts game. use resulting network implement version counterfactual regret minimization. In distinct domains, algorithm's exploitability roughly linearly dependent network's quality it difficult train CFR's performance as good CFR access full

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Iterative Algorithm for Solving Two-player Zero-sum Extensive-form Games with Imperfect Information

We develop and evaluate a new exact algorithm for finding Nash equilibria of two-player zero-sum extensive-form games with imperfect information. Our approach is based on the sequenceform representation of the game, and uses an algorithmic framework of double-oracle methods that have been used successfully in other classes of games. The algorithm uses an iterative decomposition, solving restric...

متن کامل

Solving two-person zero-sum repeated games of incomplete information

In repeated games with incomplete information, rational agents must carefully weigh the tradeoffs of advantageously exploiting their information to achieve a short-term gain versus carefully concealing their information so as not to give up a long-term informed advantage. The theory of infinitelyrepeated two-player zero-sum games with incomplete information has been carefully studied, beginning...

متن کامل

Limited Lookahead in Imperfect-Information Games

Limited lookahead has been studied for decades in complete-information games. We initiate a new direction via two simultaneous deviation points: generalization to incomplete-information games and a game-theoretic approach. We study how one should act when facing an opponent whose lookahead is limited. We study this for opponents that differ based on their lookahead depth, based on whether they,...

متن کامل

Hustling in Repeated Zero-Sum Games with Imperfect Execution

We study repeated games in which players have imperfect execution skill and one player’s true skill is not common knowledge. In these settings the possibility arises of a player “hustling”, or pretending to have lower execution skill than they actually have. Focusing on repeated zero-sum games, we provide a hustle-proof strategy; this strategy maximizes a player’s payoff, regardless of the true...

متن کامل

Endgame Solving in Large Imperfect-Information Games

The leading approach for computing strong game-theoretic strategies in large imperfect-information games is to first solve an abstracted version of the game offline, then perform a table lookup during game play. We consider a modification to this approach where we solve the portion of the game that we have actually reached in real time to a greater degree of accuracy than in the initial computa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Artificial Intelligence

سال: 2023

ISSN: ['2633-1403']

DOI: https://doi.org/10.1016/j.artint.2022.103805